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[0001] 



The present invention relates to microphone output signal 



levels, and more specifically, to the calibration thereof to a 
5 desired level. When output levels of different microphones are 

compared, it is assumed that the acoustical excitations thereof are 
identical. Manufacturers supply microphones having output levels 
varying around a specified mean value. For the often-used back- 
electret microphones, such tolerances are ±4 dB. Consequently, the 
10 output levels of such microphones may show a difference of up to 8 
dB. Microphones with tolerances of ±2 dB are sometimes available. 
These, however, are more expensive. 

Description Of The Related Art 

15 [0002] A usual approach for gain calibration of a microphone is 
carried out in an anechoic chamber, i.e., a chamber without 
reflections or reverberation. A loudspeaker is placed in front of 
the microphone (at an angle of 0°) inside the anechoic chamber. The 
loudspeaker plays a noise sequence at a known power level and the 

20 power of the microphone response is measured. Subsequently, an 
adjustable gain is set. 

[0003] Further an audio processing arrangement is disclosed in 
International Patent Application No. WO 99/27522. According to this 
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prior art reference, filtered sum and weighted sum beamforming are 
developed for maximizing power at the output. Filtered sum 
beamforming (FSB) makes the direct contributions maximally coherent 
upon adding thereof . 
5 [0004] With multi -microphone algorithms such as beamforming, it 
is very important to sort the microphones during production to 
obtain sets with level differences within the required tolerances. 
[0005] Moreover, with some multi -microphones systems, the 
consumer may buy additional microphones later in time, which will 
10 also have to be calibrated before installation. 

SUMMARY OF THE INVENTION 
[0006] The present invention provides a device for calibration 
of a microphone, comprising: 
15 [0007] a loudspeaker for converting a loudspeaker input signal 
into sound; 

[0008] a microphone for converting received sound into a 
microphone output signal, and 

[0009] calibration means for calibrating the output power of the 
20 microphone relative to a desired power level, said calibration 

means comprising impulse response estimating means for estimating 
an acoustic impulse response of the microphone and/or the 
environment at the microphone by correlating the microphone output 
signal and the loudspeaker input signal when the microphone 
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receives sound from the loudspeaker, whereby the output power of 
the microphone is estimated, 

[0010] As indicated above, calibration of microphones is often 
of crucial importance for good performance of multi -microphone 
5 systems. The present invention is concerned with the adaptive 
calibration (in software) of microphones under reverberant room 
conditions. An advantage of the present invention is that the 
microphones need not be selected or calibrated when manufacturing 
an audio system, saving production time and, sometimes, additional 

10 hardware. The present invention can be applied in all speech 
communication systems where one or more microphones and a 
loudspeaker are available. One can think of hands-free 
telecommunication systems, but also of hands- free speech 
recognition systems for voice control of, e.g., a television set. 

15 [0011] Non-uniformly ageing of microphones, which can also lead 
to output level differences, will also be neutralized by this 
invention. 

[0012] In a preferred embodiment of the invention, direct part 
removal means are provided for removing the direct part of the so- 

20 called acoustic impulse response (a.i.r.) in order to use, 

especially, the diffuse part of the a.i.r. An advantage hereof is 
that calibration can be executed during use in a normal 
environment, e.g., a room of a microphone, and without the need for 
additional hardware. Calibration during the actual use also allows 

25 for either absolute calibration or relative calibration. 



M: \UserPublic\GO\ST27GOA0.GOR 3 



PHNL 000363 

[0013] Another preferred embodiment comprises high- and low-pass 
filter means for filtering low and high frequencies, allowing for 
better calibration by using frequency ranges where signal quality 
is best suitable for processing. 
5 [0014] Another preferred embodiment comprises squaring and 

summation means for creating a representation of the current power 
level of the diffuse sound- field response of the microphone, in 
order to create a value that can be related to a desired level . 
[0015] The invention further preferably comprises relating means 
10 for relating the power level of the (diffuse) microphone response 
with a desired power level. 

[0016] Although it may be possible to obtain an absolute value 
for the desired power level, this desired power level is preferably 
available from a reference microphone. 

15 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0017] Further advantages, features, and details of the present 
invention will become clear when reading the following description 
with reference to the annexed drawings, in which: 
20 [0018] Fig. 1 is a perspective and partly diagrammatic view of a 
preferred embodiment of present invention in an audio conferencing 
system; 

[0019] Fig. 2 is a diagram of a prior art setting for 
calibration of a microphone in an anechoic chamber; 
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[0020] Fig. 3 shows graphs of a typical a.i.r. at 0° of a 
microphone and a corresponding energy decay curve (e.d.c.) as a 
function of time; 

[0021] Fig. 4 shows graphs of a typical a.i.r. at 180° on the 
5 same microphone as in Fig. 3, and the corresponding decay curve 
(e.d.c.) as a function of time; 

[0022] Fig. 5 is a diagram of adaptive microphone calibration as 
included in the embodiment of Fig. 1; 

[0023] Fig. 6 is a diagram of adaptive microphone calibration 
10 relative to a reference microphone which can also be used in the 
embodiment of Fig. 1; 

[0024] Fig. 7 is a diagram of relative calibration relative to 
reference microphone which can be also be used in the embodiment of 
Fig . 1 ; and 

15 [0025] Fig. 8 is a diagram of a band-pass filter and subsequent 
squaring and summation operation for use in the embodiments of 
Figs. 5-7. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 
20 [0026] Fig. 1 shows an audio conferencing system comprising a 
main console 1 and one or two satellite microphones 2 for a larger 
pick-up range of speech, each satellite microphone containing a 
microphone. The audio conferencing system is connected to a floor 
unit 23, which, in turn, is connected to a power source 24 and a 
25 telephone network 25 of some kind, e.g., a PSTN (RJ11) or an ISDN 
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(RJ45) . The main console comprises, a loudspeaker for producing 
(voice) sounds, and three microphones for picking up (voice) sound. 
Furthermore, telephone means are included for making contact to 
other telephones through a telephone network. The microphones 
5 preferably inter-operate as seamlessly as possible. For this 

purpose, the invention provides means of eliminating the need of 
pre-installation calibration of the microphones in the satellite 
microphones or even of the microphones in the main console. 
[0027] Another example of use of a device according to present 

10 invention (not shown) , relates to voice-based commanding of a 

television set, e.g., for switching channels or controlling the 
volume, by using microphone input. This can also be embodied in a 
form with one or several microphones. In order for a system to use 
the microphone output signal, calibration may be necessary. 

15 [0028] For clarification, some acoustical concepts are explained 
that are relevant for understanding the detailed description of the 
drawings. Fig. 2 shows a room containing a loudspeaker 3 and a 
microphone 4 aimed towards that loudspeaker (thus at 0°) . 
[0029] An acoustic impulse response (a.i.r.) can be estimated 

20 from the loudspeaker excitation signal and the microphone response 
by correlation techniques. An a.i.r. is the response on an 
impulsive acoustic excitation. An example of such an estimated 
a.i.r. is depicted in Fig. 3. During the first few milliseconds, 
the response is zero due to the delay from the limited speed of 

25 sound in air. Next, a large peak can be observed, which is due to 
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the response to the direct acoustic propagation of the sound from 
the speaker towards the microphone, and is called the direct sound 
field contribution. This peak has a normalized value of 1.0. The 
tail relates to this value as depicted in this graph. The tail of 
5 the a.i.r. is due to reflections against room boundaries, and is 

called the diffuse sound field contribution. These reflections have 
a random character and increase statistically in density and 
decrease exponentially in amplitude over time. The combined effects 
of the reflections are called reverberation. 

10 [0030] An important function of the a.i.r. is the energy decay. 
In discrete time, with n the sample index, the energy decay at 
index n amounts to the energy left in the tail of the a.i.r. In 
Fig. 3, the so-called energy decay curve (e.d.c.) corresponding to 
a.i.r. is also logarithmically plotted. On the Y-axis, the quantity 

15 is measured in dB. The e.d.c. shows an abrupt change due to the 
direct component. The difference in energy decay just before and 
just after this jump is called the clarity index. A larger clarity 
index implies a larger direct /diffuse ratio, and thus, less 
reverberation. The envelope of the diffuse tail of the a.i.r. has 

20 an exponential decay which leads to the constant slope of the 

logarithm of the tail of the e.d.c. The reverberation time T60 is 
the time interval in which the reverberation level drops down by 60 
dB. It is found for this case that T60 = 0.36 s. 
[0031] Microphones can have unidirectional beam patterns. 

25 Unidirectional microphones only pick up acoustic signals from a 



M: \UserPublic\GO\ST27GOA0 .GOR 7 



PHNL 000363 

certain range of angles around 0°, i.e., they more or less block 
acoustic signals arriving at 180°. This means that the direct field 
contribution of an a.i.r. measured at 180° will be almost zero. 
[0032] In Fig. 4, the a.i.r. and the e.d.c. of the same 
5 (unidirectional) microphone as in Fig. 3, but now at 180°, are 

plotted. There also is a value normalized to one, yet only the tail 
is shown as this represents the diffuse response. By comparing Fig. 
3 and Fig. 4, it appears that at 180°, the direct contribution has 
vanished while the diffuse contribution has the same exponential 

10 envelope in both Figs. 

[0033] In the following, it is assumed that the energy in the 
diffuse tail of the a.i.r. does not depend on the microphone or 
loudspeaker orientation and location in the room. In practice, some 
variation are found depending on orientation and location, but 

15 these variations are small when the acoustic absorption pattern in 
the room is more or less homogenous and the reverberation over time 
is not to small (T60 > 100 ms) . It is worth mentioning that a 
typical room has a reverberation larger than 300ms. A general rule 
is that the bigger a room, the longer the reverberation time. 

20 [0034] The present invention uses, as input, not only the 
microphone response, but also the excitation signal of the 
loudspeaker (Fig. 2) . First, the a.i.r. is estimated from the 
loudspeaker to the microphone using a well-known correlation method 
in the estimating means. When acoustic cancellation is performed, 

25 this adaptive filter is already available. The diffuse part of the 
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a.i.r. is selected in the direct part removal means. At low 
frequencies, the loudspeaker output and/or the microphone 
sensitivity is low, which leads to unreliable a.i.r. coefficients. 
Therefore, a high-pass filter is applied to the diffuse part of the 
5 a.i.r. At the highest frequencies, near the Nyquist frequency, the 
signal levels will also be low due to anti-aliasing filters. Thus, 
to deal with unreliable a.i.r. coefficients at high frequencies, a 
low-pass filter is applied. 

[0035] In Fig. 5, these high- and low-pass filters are combined 

10 to form a band-pass filter. The filtered coefficients are squared 
and summed in the squaring and summation means, which leads to 
actual power level 14 representing the current power of the diffuse 
microphone response. This power level is related to a desired power 
level 20 and the gain factor is determined as the square root of 

15 the quotient of these power levels. 

[0036] In the preferred embodiment, this calibration method can 
be applied each time the adaptive filter comes up with a new 
estimation of the a.i.r. For increased robustness of an acoustic 
echo canceller, a programmable filter is sometimes used (as 

20 described in U.S. Patent 4,903,247). The adaptive filter runs in 
the background and the programmable filter, which takes its 
coefficients conditionally from the adaptive filter, is used for 
the actual echo removal. In this case, it is best to take the 
coefficients of the programmable filter and apply the calibration 

25 procedure after each coefficient transfer. 
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[0037] The loudspeaker 3 (Fig. 5) gets a loudspeaker input 
signal 5. Microphone 4 receives the sound that is being produced by 
the loudspeaker 3 and transforms this into microphone output signal 
6. Digital values of signals 5 and 6 are fed to estimator 7. The 
5 estimator 7 produces estimated values 9 that pass through to direct 
part removal part 8 embodied in software. From here, digital values 
10 are fed to digital band-pass filters 11. Signals 12 from these 
band-pass filters are fed to a squaring and summation program 13 . 
[0038] The estimated actual power level (P) 14 is fed to a 

10 relating program 15 as is an (external) desired power level (Q) 20. 
From here, the calibration gain factor 16 is fed to the averaging 
means 17. An adjusted calibration gain factor 18 is fed back to the 
microphone output signal in order to form the calibrated signal 19. 
[0039] Especially when combined with an adaptive filter for 

15 acoustic echo cancellation, the proposed microphone calibration 
method can be applied all the time that the system is active. In 
Fig. 5, the calibration factor, being the square root of the 
desired power level divided by the actual power level, is averaged 
to ensure that successive calibration gain factors will change 

20 smoothly. Such averaging can be done with a first -order recursion. 
This averaging procedure can also be applied to the actual power 14 
and the desired power 20 before the calculation of the square root 
of the desired power level divided by the actual power level . 
[0040] Below, the process of the embodiment of Fig. 5 is 

25 described. This preferred embodiment of the present invention 
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requires, as input, not only the microphone response 6 f but also 
the excitation signal 5 of the loudspeaker (Fig. 2) . First, the 
a.i.r. is estimated from the loudspeaker to the microphone using a 
correlation method in the estimating means 7. Only the diffuse part 
5 of the a.i.r. is selected in the direct part removal means 8. The 
band-pass filter 11 is used for filtering out high and low 
frequencies. The filtered coefficients are squared and summed in 
the squaring and summation means 13, which leads to actual power 
level 14 representing the current power of the diffuse microphone 
10 response. This power level is related to a desired power level 20, 
and the gain factor is determined as the square root of the desired 
power level divided by the actual power level . 

[0041] Fig. 6 shows the same configuration as Fig. 5 except for 
the averaging means 17 and relating program 15. This configuration 
15 is used in case of referential calibration for the reference 

microphone, whereby the desired power level 20 is input for the 
relating means 15 of the other microphones calibration means using 
the reference microphone as their reference. 

[0042] Fig. 7 shows how the building blocks of Fig. 5 and 6 can 
20 be combined for referential calibration for use in, e.g., an audio 
conferencing system as in Fig. 1. 

[0043] Fig. 8 shows, graphically, how the averaging algorithm 
would work in calculating the power P of a diffuse sound- field 
response of a microphone. The scheme consists of a band-pass filter 
25 followed by summation of the squared output values. At a sampling 
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rate of 8 kHz, good filter parameters, leading to low-pass and 
high-pass cutoff frequencies (-3 dB) of about 200 Hz and 3.6 kHz, 
respectively, are b=0.800, al=0.128, and a2=0.621. 
[0044] The present invention is not limited to the above 
5 preferred embodiments; the rights applied for are defined in the 
annexed claims . 
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